Emoticon Smoothed Language Models for Twitter Sentiment Analysis

نویسندگان

چکیده

Twitter sentiment analysis (TSA) has become a hot research topic in recent years. The goal of this task is to discover the attitude or opinion tweets, which typically formulated as machine learning based text classification problem. Some methods use manually labeled data train fully supervised models, while others some noisy labels, such emoticons and hashtags, for model training. In general, we can only get limited number training models because it very labor-intensive time-consuming label tweets. As with hard them achieve satisfactory performance due noise labels although easy large amount Hence, best strategy utilize both However, how seamlessly integrate these two different kinds into same framework still challenge. paper, present novel model, called emoticon smoothed language (ESLAM), handle basic idea on data, then smoothing. Experiments real sets demonstrate that ESLAM effectively outperform those using one them.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Emoticon Smoothed Language Models for Twitter Sentiment Analysis

Twitter sentiment analysis (TSA) has become a hot research topic in recent years. The goal of this task is to discover the attitude or opinion of the tweets, which is typically formulated as a machine learning based text classification problem. Some methods use manually labeled data to train fully supervised models, while others use some noisy labels, such as emoticons and hashtags, for model t...

متن کامل

Language-Independent Twitter Sentiment Analysis

Millions of tweets posted daily contain opinions and sentiment of users in a variety of languages. Sentiment classification can benefit companies by providing data for analyzing customer feedback for products or conducting market research. Sentiment classifiers need to be able to handle tweets in multiple languages to cover a larger portion of the available tweets. Traditional classifiers are h...

متن کامل

Twitter Sentiment Analysis

2012 CERTIFICATE It is certified that the contents and form of thesis entitled " Twitter Sentiment Analysis " submitted by Afroze Ibrahim Baqapuri (NUST-BEE-310) have been found satisfactory for the requirement of the degree.

متن کامل

Scaling Smoothed Language Models

In Continuous Speech Recognition (CSR) systems a Language Model (LM) is required to represent the syntactic constraints of the language. Then a smoothing technique needs to be applied to avoid null LM probabilities. Each smoothing technique leads to a different LM probability distribution. Test set perplexity is usually used to evaluate smoothing techniques but the relationship with acoustic mo...

متن کامل

Sentiment Analysis on Twitter

With the rise of social networking epoch, there has been a surge of user generated content. Microblogging sites have millions of people sharing their thoughts daily because of its characteristic short and simple manner of expression. We propose and investigate a paradigm to mine the sentiment from a popular real-time microblogging service, Twitter, where users post real time reactions to and op...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2021

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v26i1.8353